Parallel and distributed computing for Big Data applications
نویسندگان
چکیده
منابع مشابه
Parallel and distributed computing for data mining
Similar scenarios will occur in other areas: we will see large numbers of radiological images generated in hospitals and immense product and customer databases as the Internet and e-commerce continue to expand.1 Exploring useful information from such data will require efficient parallel algorithms running on high-performance computing systems with powerful parallel I/O capabilities. Without suc...
متن کاملCloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming
The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hos...
متن کاملDistributed and parallel time series feature extraction for industrial big data applications
The all-relevant problem of feature selection is the identification of all strongly and weakly relevant attributes. This problem is especially hard to solve for time series classification and regression in industrial applications such as predictive maintenance or production line optimization, for which each label or regression target is associated with several time series and meta-information s...
متن کاملBig data mining with parallel computing: A comparison of distributed and MapReduce methodologies
Mining with big data or big data mining has become an active research area. It is very difficult using current methodologies and data mining software tools for a single personal computer to efficiently deal with very large datasets. The parallel and cloud computing platforms are considered a better solution for big data mining. The concept of parallel computing is based on dividing a large prob...
متن کاملDistributed and Big Data Storage Management in Grid Computing
Big data storage management is one of the most challenging issues for Grid computing environments, since large amount of data intensive applications frequently involve a high degree of data access locality. Grid applications typically deal with large amounts of data. In traditional approaches high-performance computing consists dedicated servers that are used to data storage and data replicatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Concurrency and Computation: Practice and Experience
سال: 2016
ISSN: 1532-0626
DOI: 10.1002/cpe.3813